SEQUENCE LISTING 



SEQ ID NO. 1 (WGA1), SEQ ID NO. 2 (WGA2), or SEQ ID NO. 3 (WGA3). 

WGA1 (A) cDNA sequence: 

5 . ■ . > " • . 

atgaag atgatgagca ccagggccct cgcgctcggc ■ 

61 gcggctgccg tcctcgcctt cgccgcggcg accgctcagg cccagaggtg 
cggcgagcaa 

121 ggcagcaaca tggagtgccc caacaacctc tgctgcagcc agtacggcta 
10 ctgcgggatg . 

' * : 181 ggcggcgact actgcggcaa gggctgccag aacggcgcct gctggaccag 
caagcgetgc 

241 ggcagccagg ccggcggcgc gacgtgcacc aacaaccagt gctgcagcca' 
gtacgggtac 

15 . .301 tgcggcttcg gcgccgagta ctgcggcgcc ggctgccagg gcggcccctg 

ccgcgccgac 

361 atcaagtgcg gcagccaggc cggcggcaag ctgtgcccga acaacctctg 
ctgcagccag ' ' . ^ 

421 tggggattct gcggcctcgg ttccgagttc tgcggcggcg gctgccagag 
20 cggtgcttgc- 

481 ' agcaccgaca aaccgtgcgg caaggacgcc ggcggcagag tttgcactaa 
caactactgt ... f \ • 

541 tgtagcaagt ggggatcctg tggcatcggc ccgggctatt gcggtgcagg 
ctgccagagt - . . 
25 '., 601 ggcggctgcg atggtgtctt cgccgaggcc atcaccgcca actccactct 

■ tctecaagaa ' 
661 tga 



30 WGA1 (A) protein sequence: 

' MKMMSTRALALGAAAVLAFAAATAQAQRCGEQGSNMECPNNLCC' , 
SQYGYCGMGGDYCGKGCQNGACWTSKRCGSQAGGATCTNNQCCSQYGYCGFGAEYCGA 



35 



GCQGGPCRADIKCGSQAGGKLCPNNLCCSQWGFCGLGSEFCGGGCQSGACSTDKPCGK 



40 



45 



50 



WGA2 (D) cDNA sequence: 

atgaga aagatgatga gcaccatggc ccttacgctc ggcgcggctg tcttcctcgc 

cttcgccgcg gcgaccgcgc aggcccagag gtgeggcgag cagggcagca 
acatggagtg . 

ccc'caacaac , ctctgctgca gccagtacgg gtactgcggc atgggcggcg 
actactgcgg 

caagggctgc cagaacggcg cctgctggac cagcaagcgc tgcggcagcc 
aggccggcgg 

ggcgacgtgt cccaacaacc actgctgcag ccagtacggg cactgcggct 
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tcggagccga 

301 gtactgcggc gccggctgcc agggcggccc ctgccgcgcc gacatcaagt 
gcggcagcca 

361 gtccggcggc aagctatgcc cgaacaacct ctgctgcagc cagtggggat- 
5 tctgcggcct 

,421* aggttccgag ttctgcggcg ^gtggctgcca gagcggtgct tgcagcaccg 
- ■ acaagcegtg 1 ■ 

481. cggcaaggac gccggcggca gggtttgcac taacaactac tgttgtagca 
agtggggatc ' 
10 541 .ctgtggcatc ggcccgggct attgcggtgc aggctgccag agcggcggct 

gtgacgctgt 

601 ctttgccggc gccatcaccg ccaactccac tcttctcgca gaatga 
WGA2 (D) protein sequence: 

MRKMMSTMALTLGAAVFLAFAAATAQAQRCGEQGSNMEpPNNLC. ' 
CSQYGYCGMGGDYCGKGCQNGACWTSKRCGSQAGGATCPNNHGCSQYGHCGFGAEYCG 
20 AGCQGGPCRAD I KCGSQSGGKLC PNNLCCSQWGFCGLGS E FCGGGCQSGACSTDKPCG 




WGA3 (B) cDNA sequence: 

25 V / ■ . - : • 

caaaggtgcg gcgagc'aggg cagcggcatg gagtgcccca acaacctgtg ctgcagccag 

.;, 61 tacggctact gcgggatggg cggcgattac tgcggcaagg gQtgccagaa 
cggcgcgtgc ' 

.121 tggaccagca agcggtgtgg cagccaggcc ggcggcaaga cgtgccccaa 
30 caaccactgc 

\, 181 tgcagccagt acgggcactg cggcttcggc gcggagtact gcggcgccgg 
■ ctgccagggc ' 

'■ .241 ggcccctgcc gcgccgacat caagtgcggc agccaggccg gcggcaagct 
gtgccccaac 

•35. 301 aacctctgct gcagccagtg ggggtactgc ggcctcggtt ccgagttctg 

cggcgagggc , 

361 tgceagaacg gcgcttgcag- caccgacaag "ccgtgtggca aggacgccgg 
cggcagggtt . - , "\ 

42L' tgcactaaca actactgctg tagcaagtgg ggatcctgtg gcatcggtcc 
40 cggctactgc '■ 

481 ggtgcaggct gccagagcgg cggctgcgat ggtgtcttcg ccgaggccat 
cgecaccaac . 
541 tccactcttc tcgcagaatga 

45 

WGA3 (B) protein sequence: 

QRCGEQGSGMECPNNLCCSQYGYCGMGGDYCGKGCQNGACWTSK 
50 RCGSQAGGKTCPNNHCCSQYGHCGFGAEYCGAGCQGGPCRADIKCGSQAGGKLCPNNL 
CCSQWGYGGLGSEFCGEGCQNGACSTDKPCGKDAGGRVCTNNYCCSKWGSCGIGPGYC 
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GAGCQSGGCDGVFAEAIATNSTLLAE 
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